Towards a Latvian Valency Lexicon
نویسندگان
چکیده
A development of a verb valency lexicon for Latvian has been recently started. The chosen approach combines and supplements the experience of similar lexical resources developed for other languages. The paper describes our approach to the verb valency annotation—the valency layers (syntactic and semantic valency, selectional restrictions) and the set of the semantic roles. The annotation process using an online tool developed for the valency annotation is also briefly described. From the annotated corpus examples, the valency patterns (models) for each verb are generated considering only the core semantic roles. As a result of this work, a summarized information of valency patterns for each verb will be available, as well as a corpus of the annotated examples. Currently, more than 150 verbs (more than 16 000 sentences) have been annotated using data from the Balanced Corpus of Modern Latvian.
منابع مشابه
A Syntactic Valency Lexicon for Persian Verbs: The First Steps towards Persian Dependency Treebank
Valency lexicons are valuable resources for natural language processing. The need for new resources for languages encourages researchers to collect new datasets. One of the most important datasets is valency lexicons. In valency lexicons, information about obligatory and optional complements of words is annotated at the syntactic and semantic levels. In this paper, we report the development of ...
متن کاملValency Lexicon of Czech Verbs: Towards Formal Description of Valency and Its Modeling in an Electronic Language Resource
Valency refers to the capacity of verb (or a word belonging to another part of speech) to take a specific number and type of syntactically dependent language units. Valency information is thus related to particular lexemes and as such it is necessary to describe valency characteristics for separate lexemes in the form of lexicon entries. A valency lexicon is indispensable for any complex Natura...
متن کاملTowards a Corpus-based Valency Lexicon of Czech Nouns
Corpus-based Valency Lexicon of Czech Nouns is a starting project picking up the threads of our previous work on nominal valency. It builds upon solid theoretical foundations of the theory of valency developed within the Functional Generative Description. In this paper, we describe the ways of treating valency of nouns in a modern corpus-based lexicon, available as machine readable data in a fo...
متن کاملTowards English-Czech Parallel Valency Lexicon via Treebank Examples
The paper describes an ongoing project of building a bilingual valency lexicon in the framework of Functional Generative Description. The bilingual lexicon is designed as a result of interlinking frames and frame elements of two already existing valency lexicons. First, we give an overall account of the character of the lexicons to be linked, second, the process of frame linking is explained, a...
متن کاملIncorporation of a Valency Lexicon into a TectoMT Pipeline
In this paper, we focus on the incorporation of a valency lexicon into TectoMT system for Czech-Russian language pair. We demonstrate valency errors in MT output and describe how the introduction of a lexicon influenced the translation results. Though there was no impact on BLEU score, the manual inspection of concrete cases showed some improvement.
متن کامل